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Let Xn = (xij) be an n by p data matrix, where the n rows form a 
0^ ' random sample of size n from a certain p-dimensional population dis- 

tribution. Let R„ — {pij)he the pxp sample correlation matrix of Xn ; 
that is, the entry pij is the usual Pearson's correlation coefficient be- 
tween the ith column of Xn and jth column of Xn- For contemporary 
data both n and p are large. When the population is a multivariate 
,S^ ' normal we study the test that Ho : the p variates of the population 

^ I are uncorrelated. A test statistic is chosen as L„ =maxi^j \Pij\- The 

asymptotic distribution of L„ is derived by using the Chen-Stein 
Poisson approximation method. Similar results for the non- Gaussian 
case are also derived. 

.^ ' 1. Introduction. Let X„ = (xij) be an n by p data matrix, where the n 

00 ' rows are observations from a certain multivariate distribution and each of 

, p columns is an n observation from a variable of the population distribu- 

^— V ' tion. Let pij be the Pearson correlation coefficient between the ith and jth 

■^ . columns of Xn- That is 



/I i\ /)••— l^k=l\^k,i Xi){Xk,j Xj) 



J2k=l{^k,i — Xi) ■ •\/Z]fc=l(^fc 



2 . . hrV- (^, . _ !T-.\2 



where Xi = (1/n) X]fc=i ^fc,i- Then i?„ := {pij) is a p by p symmetric matrix. 
It is called the sample correlation matrix generated by Xn- 
k><( \ Suppose the population is a multivariate normal distribution with mean 

;_j ■ vector //, covariance matrix 5] and correlation coefficient matrix R. When 

the sample size n and the dimension p are large and comparable, Johnstone 
[14] studied the test with null hypothesis Hq : XI = I under assumption that 
/i = 0, where I is the identity matrix. The null hypothesis is equivalent 
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2 T. JIANG 

to that the population distribution is the product of p univariate standard 
normal distributions. The test statistic is chosen as the maximum eigenvalue 
of the sample covariance matrix X'^Xn according to the method principal 
component analysis (PC A). It is proved that the asymptotic distribution of 
the maximum eigenvalue is the Tracy- Widom law. 

When both n and p are large we consider the test with null hypothesis 

(1.2) Fo:R = /. 

Equivalently, the population distribution is a product of univariate normal 
distribution N{iJ,i,af)^s for some unknown //j's and unknown itj's. The dif- 
ference between this test and the one in [14] mentioned above is that all m's 
do not have to be identical and all o"j's do not have to be identical, either. 
Besides, we do not assume that /ij's and Ui's are known. Our test seems to 
be more natural and practical. The maximum eigenvalue Amax of the sample 
correlation matrix i?„ can be taken as the test statistic according to PCA. 
But the distribution of Amax is not clear so far, although there is evidence 
that Amax niay also follow the Tracy-Widom law asymptotically as shown 
in [13]. 

In this paper we do not pursue the maximum eigenvalue Amax as the test 
statistic because of its complexity. Instead we choose the following intuitive 
one: 

L„= max \pij\, 

l<i<j<p 

where pij is as in (1.1). Barbour and Eagleson [6] provided a general idea of 
dealing with the tail of L„ by using the Poisson approximation method. In 
this paper we will derive the strong law and limiting distribution of L„ via 
this method. In fact, we will prove more general results; the observations 
Xjj's do not have to be Gaussian. Our results will be precisely stated next. 
Suppose {^,Xij, i,j = 1,2,...} are i.i.d. random variables. Let Xn = 
{^ij)i<i<n,i<j<p- Let xi,X2,- ■ ■ ,Xp be the p columns of X„. Then X„ = 
(xi,X2, . . . ,Xp). Let Xk be the sample average of x^, that is, Xk = i^/n)J27=i^ik- 
We write Xi — Xi for Xj — x^e, where e = (1, 1, . . . , 1)-^ G W^. Then, pij, the 
Pearson correlation coefficient in (1.1) between Xj and Xj can be rewritten 
as 

(1-3) P^, = P^^^P^^^^,. l<i,J<P, 

J/J 'i 7 7 

where || • || is the usual Euclidean norm. Obviously, pa = 1 for each i. 
First, we obtain a strong limit theorem as follows. 

Theorem 1.1. Suppose i?|^|'^°~^ < oo for any e > 0. Ifn/p^ 7 G (0,oo), 
then 



lim ./- Ln = 2 a.s. 

n^oo \l [ogn 
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The above strong law of L„ does not depend on p although Xn is an n 
by p matrix. For the limiting distribution the following holds. 

Theorem 1.2. Suppose that E|^p°+^ < cx) for some e > 0. // n/p -^ 7, 
then 

P{nLl - 41ogn + log(logn) <y)^ ^~Ke~v/^ 
as n^ 00 for any y G M, where K = (7^ vSvr )~^ . 

The limiting distribution appearing in Theorem 1.2 is called the extreme 
distribution of type I. 

For constants a^ G M^ and 6^ G R-^, i = 1, 2, . . . ,p, it is easy to see that 
the matrix (aixi + 616,02X2 + 62^, • . • ,apXp + hpe) and X„ = (xi,X2, • • • ,Xp) 
generate the same correlation matrix Rn- Also, if ^ ~ -^(0, 1), then Ee*^ < 
00 for all t < 1/2. We immediately have the following result. 

Corollary 1.1. Suppose {xij;i > l,j > 1} are independent and xij ~ 
N{fij,a'^) for some fij and Uj 7^ for all i and j. Let the sample correla- 
tion matrix Rn be obtained from, X„ := (xjj; 1 <i <n,l < j < p). Then the 
conclusions of Theorems 1.1 and 1.2 also hold. 

The above corollary gives the distribution of the test statistic L„ under 
the null hypothesis in (1.2). 

Theorem 1.3 below is used in the proof of Lemmas 3.1 and 3.2. These two 
lemmas are key to prove Theorems 1.1 and 1.2. It is a nonasymptotic in- 
equality on the moderate deviation of partial sums of independent random 
variables. Though sums of independent random variables are well under- 
stood, we did not notice a similar result in the literature, for example, [19] 
and [20]. The usual moderate results such as those in [15] and Theorem 3.7.1 
on page 109 from [8] are not applicable in our case. The reason is that we 
do not have identical distribution assumption. Second, asymptotic bounds 
do not work in our proof because our case involves an uniform bound of 
infinitely many such probabilities. This is evident from Lemma 2.1 in Sec- 
tion 3. There is a similar situation in the large deviation case. The Chernoff 
bound (see, e.g., (c) of Remarks on page 27 from [8]) is a nonasymptotic 
bound of sums of i.i.d. random variables. But the classical Cramer-type 
large deviation is a limiting result. The Chernoff bound is used in the proof 
of theorems in [10] and [11] for the same reason of proving our Theorems 
1.1 and 1.2 via the following Theorem 1.3. 

Theorem 1.3. Let {i]i, 1 <i <n} be independent random variables with 
mean zero. Assume maxi<j<n E\Vi\^ < 00 for some /3 > 2. Then for any p> 
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and t> 0, 
(1.4) 

where 

1 



f(^S->-')^^-'''--"'"'"'' 



Ms := - V£'|r/j|* /or s G (0,/5] and ii'„:=exp< 
n ^ 



i=l 



and tp = t- MfsnP^^-f^^+^^/^l 



3M| 



,2inP-(i/2)/M2 



In our applications, Kn ~ 1, tp^t and Af2 = 1- Also, Mp/n^^ "^ in (1.4) 
is smaller than the term next to it. So the probability is roughly bounded 

by e-*'/2. 

The main tool used in proving Theorems 1.1 and 1.2 are the Chen-Stein 
Poisson approximation method and probabilities of moderate deviations by 
Amosova [1] and Rubin and Sethuraman [21]. They are listed in the Ap- 
pendix. 

In traditional random matrix theories, eigenvalues are the primary con- 
cern. See, for example, [18] and [5]. This paper together with [12], in which 
the maxima of entries of certain Haar-distributed matrices were studied for 
an imaging analysis problem, suggests that the study of entries of matrices 
are also important. 

Now we state the outline of this paper. A couple of lemmas are given 
in Section 2 for the preparation of the proofs of main results. We prove all 
main results in Section 3. In the last section some known results used in the 
proofs of our theorems are listed. 

2. Auxiliary lemmas. Three lemmas are needed before we go to the proof 
of main results. The proof of the following relies on Theorem 1.3, which will 
be proved at the end. There is no circular reasoning. 



Lemma 2.1. Let {^k^Vk^v'k' k = 1,2,. . . ,n} be i.i.d. random, variables 
with mean and variance 1. Let {ti„; n > 1} be a sequence of positive 
numbers such that Un/y/nlogn -^ a £ (0, oo). // -El^il*^ < oo for some q > 
(a2 + l)(a2 + 2), then 



(2.1) 



P 



n 


>Un, 


>:^kv'k 


k=l 




k=i 



> Ur, 



0{n 



b-a^ 



as n^ OO for any b> 0. 



DISTRIBUTIONS OF THE LARGEST ENTRIES 



Proof. The two events in (2.1) are conditionally independent given 
^fc's. Denote by P^ and E^ such conditional probability and expectation, 
respectively. Then the probability in (2.1) is 

\ 2n 



(2.2) 
Set 



E 



P' 



Yl ^kVk 



fc=i 



> Ur, 



An{s) = \- 

n 



E\ik 



<S 



fc=i 

for s> 2 and Jg (0,1/2). Choose /3 G (0^ + 2, q/{a'^ + l)) and r = 0^ + 1. Let 
(i^ = |4|/3 _ E\^kf for fc = 1,2, . . . ,n. Then ^ICiT < 00. By the Chebyshev 
inequality and Lemma A.l, 



(2.3) P{An{Pr) = P{ 



EC. 
fc=i 



>n5]< {ndyE 



Ea 
fc=i 



:0(n^-^W) 



as n — > cxD, where /(r) = r/2 if r > 2, and f{r) = r — 1 ifl<r<2. Let 
{C^; 1 <k <n} be an independent copy of {C,k', 1 <k <n}. Then since, 
from (2.3), P(| Y2=i Cfcl < n5/2) > 1/2 for sufficiently large n, it follows that 



(2.4) P{ 



fc=l 



>n6\ <2P\ 



E(c.-cD 



fc=i 



>n5/2] =0(n"-«'')) 



by repeating (2.3). Given an integer j > 1, let v = nS/Aj. Then by Lemma 
A. 2, there are positive constants Cj and Dj such that 



P\ 



E^Cfc-cio 



fc=i 



>n5/2 



P\ 



E(Cfc -Ck, 



k=l 



> 2jv 



<C,p(m^xjCk-Ck\>v)+DjP{ 



k=l 



>V 



Since ^iCiT < 00, P(maxi<fc<„ |a -C'k\>v)< nP(|Ci - ([{ > v) = ©(ni^^- 
By the same argument as the equality in (2.4), we obtain 



P 



E^Cfc-Cfc, 



k=l 



>V 



0{n 



Take j = [{r - l)//(r)] + 1. It follows that 



(2.5) 



P 



T.(Ck-c'k) 

k=l 



-^jfir)} 



>nS/2] =0{n'^') 



6 T. JIANG 

as n — > oo. Combining (2.3) and (2.4) with (2.5), we obtain that 

as n^ oo. By the same arguments the above stih holds if /? is replaced by 
2. Consequently, 

\ 2n 



E 



(2.6) 



Pi 



k=l 



>Un 



<E 



P' 



^^kVk 



k=l 



>Un] I, 



A„{2)nAn{/3) 



+ Oin'-''). 



Now we apply Theorem 1.3 to the last probability in (2.6). Note E^{^kVk) = 
and E'^\^kVk\' = \^k\'{E\£,i\'') for any s > 0. In particular, E^{^kVkf = ^l- 
Thus 



Ms=l(^j:\^k\')-m\'- 



Sol-5<M2<l + d on A„(2). Since 6 £ (0, 1/2), 1/2 < M2 < 2 on A„(2). 
Moreover, M^ < (1 + £;|^i|^)£;|^i|^ < 00 on A„(/3). Choose t = Un/y/n and 
p G ((a2 + 2)/(2/3), 1/2). It is easy to verify that d := -p{l - /?) - (1/2) > 0, 



\t-tp\< 



n" 



and Kn < exp(2t n 



3„p-l/2„4tnP-V2^ 



on A„{2) n ^„(/3) for each n > 1. Then there is a constant C > 0, such that 
the probability in (2.6) under the restriction A„(2) n An{(3) is less than 

for n sufficiently large, where the fact pf3 > 1 + (a^/2) is used. Note that 
0{7i^~^') = 0{n~°' ) since r = 1 + 0?. So the left-hand side of (2.6), hence, 
the probability in (2.1) is 0{n~°' /(^+'')) by (2.2). The desired conclusion 
then follows by choosing 6 small enough. D 



For any square matrix A = (ajj), define |||^||| = maxi<j^j<„ |oij|; that is, 
the maximum of the absolute values of the off-diagonal entries of A. 

Lemma 2.2. Recall xi in (1.3). Let hi = \\xi — Xi\\/ ^/n for each i. Then 
\\\nRn - X^X„||| < {bj + 2h)Wnb^^ + nb^^l 
where 



61 = max \hi — ll, 

l<i<n 

63 = min hi, 

l<i<n 



W„. 



l<i<j<n 



oa = max \Xi\. 

l<i<n 
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Proof. As in (1.3), the (i,j)-entry of i?„ is 

( rp . rp . 1 -^ ( rf . rp . \ rp-'- rp . iv-j rp . rp . 



Pij 



J/^ J/^'U^l 7 f of t'if 4 



The (i, j)-entry of X„X„ is xj Xj. So 

\npij -xjxjl < \{hihj)~^ - 1| • \xjxj\ + n—^ -^. 

Taking maximum for both sides, we obtain 

|||ni?„, — X„X„||| 

< max |(/i,7ii)~ — ll • max Ix,- xA +n[ max — -^ 

l<j<jr<n ■' l<i<j<n \l<i<n hi 

Write 1 - (hih^y^ = {hihj)-^{{hi - l){hj - 1) + {h - 1) + {hj - 1)). Then 
the desired inequahty foUows. D 

Next we estimate 6j's. 

Lemma 2.3. Suppose that {i^,Xjj,i,j = 1,2,. . .} are i.i.d. random vari- 
ables with E^ = and Var(^) = 1. Suppose also n/p ^ 7 G (0, cxo). If E\^\'^i^^~"-> < 00 
for some a € (0, 1/2), then 

n°6i — > a.s., 63 — > 1 a.s. and n'^b^^O a.s. 

as n —> 00. 

Proof. The second hmit follows from the first one. Easily, ||xj — Xjp = 
xfxi — nl^jp. Using the fact that jx — 1| < |x^ — 1| for any x > 0, we have 
that 



(2.7) n"6i < max 

l<i<n 



T _ 



n"' max Ix,- 



l<i<n 

Note x^Xj = J2k=i^'ki- By Lemma A.S the first and the second maxima 
above go to zero when E\^\^/^^~'^' < 00. So the first limit is proved. Under 
the condition that E\^\'^'^^~°'' < 00, the limit that ii'^b^ — > a.s. is proved 
by noting the relationship between ^"64 and the right most term in (2.7). 

D 

The analysis of Wn is given in the next section. 
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3. Proof of main results. Recall the definition of pij in (1.1) and (1.3). 
To prove Theorems 1.1 and 1.2, we assume throughout this section, without 
loss of generality, that 

{.^,Xij;z,j = 1,2, ...} are i.i.d. with E',^ = and Var(,^) = 1. 

The proofs of Theorems 1.1 and 1.2 rely on an analysis of the covariance 
matrix XjX„. The (z,j)-entry oi X'^Xn is J2k=i^kiXkj- Recall 



(3.1) Wn = max 

l<i<j<n 



k=l 



as in Lemma 2.2. The first step in proving our main theorems is approx- 
imating Rn by X^Xn as shown in Lemmas 2.2 and 2.3. The second step 
is deriving the corresponding results for X'^Xn- We actually will prove the 
following two lemmas. 

Lemma 3.1. Suppose that E\^\^^~^ < oo for any e > 0. If n/p — > 
7 E (0, oo), then: 

(i) lim sup " < 2 a.s. 

n^oo V n log n 

(ii) lim inf > 2 a.s. 

n-»oo -^nlogn 



Lemma 3.1 actually says that Wn/\^nlogn — > 2 a.s. as n — > oo. The reason 
we did not combine (i) and (ii) as a single limit is that the proof of the 
combined one is relatively long. We will prove the two parts separately. 

Lemma 3.2. Suppose that £'|^p°+^ < oo for some e > 0. If n/p ^ 7 G 
(0, 00), then 

2./3r\-i 



as n —> 00 for any y € M, where an = 4n log n — n log(log n) and K = (7 vSvr 

Assuming Lemmas 3.1 and 3.2, we next prove Theorems 1.1 and 1.2. The 
proof of the former two lemmas are given later. 

Proof of Theorems 1.1 and 1.2. Choose a = 1/3. Under the condi- 
tion that E\^\^ < 00, we have from the triangle inequality. Lemmas 2.2 and 
2.3 that 

(3.2) \nLn - Wn\ < InRn - ^J^n||| < 4n-^/^Wn + 2n^/^ a.s. 
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as n is sufficiently large. Applying Lemma 3.1, it follows that An^^'^Wn = 
0{n^'^ log7i) almost surely. Hence nLn — Wn = 0{n^'^) a.s. Theorem 1.1 
then follows immediately from Lemma 3.1. Now Theorem 1.1 and Lemma 
3.1 imply that nLn + Wn = 0{\^nlogn). Consequently, 

nLl-^^ = -{nLn - W^n)(nL„ + Wn) 
n n 

= 0(n~^/6 (log n)i/2) ^g^ 
Theorem 1.2 then follows from Lemma 3.2. D 

Now we turn to prove Lemmas 3.1 and 3.2. 



Proof of Lemma 3.1(i). Given 5 G (0,1), let u;„ = (2 + 5)yJn\ogn. 

Define y\j := J2k=i^kiXkj, i,jj > 1- Then ylj is a sum of / i.i.d. random 
variables with mean zero and variance one. By Lemma A. 3, under the con- 
dition that £'1^1'^ < oo for some d > 2 + (2 + 6)'^, 

(3.3) ,,f,f<..^(l4^l>-') = 0(^) 
as / is large, where we also use the fact that 

(3.4) 1 - Hx) = -^ r e-*^/^ dt ~ -^e-^y^ 



as x — > -|-cx) (see, e.g., page 49 from [7]). Review the expression of Wn in 
(3.1). For any integer m > 4/6, 



(3.5) 



where 



max Wi < max I max |y-- 

n'"</<(n+l)'" l<j^j<(n+l)'" V"'"<'<{"+1)'" "^ 

< max \yl- ' +r„, 

l<i^j<(n+l)™ -^ 



(3.0) rn= max max y,V ~ ?/ii 

By (3.3), 



P I max 

^l<«7^i<('^+l)'" 



yt;^\>wnr^l < (n + l)2-P(|yif )| >u;„™) 



= 0(n^'5™). 
Since X]n^~ < °^' by the Borel-Cantelli lemma, 

I C"")! 

(3.7) hmsup , ^, . ^ ' <2 + 5 a.s. 

n^oo ^Jrv^ iog{rr^) 
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Now let us estimate r„ as in (3.6). 

Let {zi,Z2,...} be i.i.d. random variables with the same law as xuxu 
with partial sums Sq = and Sk = J2i=i ^i- Clearly, Ezi = and Ezf = 1. 
Observe that the distribution of y^- — y^- is equal to that of Si-n-m for all 
/ > n"\ Thus, 

P(r„>(5Vn'^log(n'")) 

(3.8) <(n+l)2™pf max \Sk\ > (5Vn"*log(n'") 

\l<k<{n+l)-^-n"^ 

< 2{n + l)2™P(|5(„+i)™_„™| > (V2)Vn™log(n-)) 

as n is sufficiently large, where Ottaviani's inequality (see Exercise 16 on 
page 74 in [7]) is used in the last inequality. Set kn = {n+ 1)"^ — n*". Note 
that, for fixed m and 5, ((5/2)-\/n'"log(n™) > (2 + S)^/knlogkn as n is suf- 
ficiently large. By (3.4) and Lemma A. 3, the last probability in (3.8) is 
equal to 0(exp(— (m — 1)(2 + 5)^(logn)/2)) provided -E|^|'' < oo for some 
d>2 + {2 + 5f. Therefore, 

P(r„ > 5Vn'"log(n"^)) = ©(n""), 

where u = {m — 1)(2 + 5)^/2 — 2m > 1, since 7n is chosen such that m > 4/5. 
By the Borel-Cantelli lemma again, 

(3.9) limsup — - < 6 a.s. 

n^oo •\/n™log(n'") 

By (3.5), (3.7) and (3.9), we obtain that 

max„m<;<(„+i)m W; ^„ , „„ 
lim sup == < 2 + 2o a.s. 

n->oo •\/?T,'"log(n™) 

for any sufficiently small 6 > 0. This implies inequality (i) in Lemma 3.1. 

D 

Proof of Lemma 3.1(ii). We continue to use the notations in the 
proof of (i) of Lemma 3.1. For any 6 £ (0, 1), define Vn = {2 — 6)\/n\ogn. We 
first claim that 

(3.10) P{Wn<Vn) = 0(^ 

\n° 

as n — > oo for some positive constant 6' depending on 5 and the distribu- 
tion of S, only. If this is true, take an integer m such that m > 1/5' . Then 
P{Wnrn < Vn--) = Oil/n^'"^). Since En^""^'™ < oo, by the Borel-Cantelh 
lemma, we have that 

Wr,m 

(3.11) liminf^^ >2-6 a.s. 

n^oo ^n™log(n'") 
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for any 6 G (0, 1). Recalling the definition of r„ in (3.6), we have that 

inf Wk > Wn^ - rn- 

n'"<fc<(n+l)™ 

By (3.9) and (3.11), we have that 

n^oo ^n"^log(n'^) 

for any 6 small enough. This implies (ii) of Lemma 3.1. 

Now we turn to prove claim (3.10) by Lemma A. 4. 

Take / = {{i,j); I <i < j <p}- For a = {i,j) € I, set B^ = {{k,!) £ I; 
one of k and I = i oi j, but (A;, /) / a}, rja = Wij \, t = Vn and Aa = Aij = 
ilVij \ > ''^n}- By Lemma A. 4, 

(3.12) P{Wn<Vn)<e^^-+bi,n + b2,n. 

Evidently 

A„, = ^P(A,.), 
(3.13) 

bi,n<2p^P{Ai2f and 62,™ < 2p3p(^i2^i3). 

(n) 

Remember that y\2 is a sum of i.i.d. random variables with mean and 
variance 1. Recall (3.4). By Lemma A. 3, 

(3.14) P{Au) ^ ^ 



{2 - S)^/2TTlogn n(2-'5)V2 



(")| ^ ., L,(")| 



as n ^ 00 provided E\rj\° < 00. Note that ^(^12^13) = -P(|yi2 I > ""n, I2/13 | > 
Vn) and Vn/Vn\ogn^2 - S. By Lemma 2.1, ^(^12^13) = 0(n^"(2"'')^) for 
any 6 > provided ^^l^l" < 00 for some q>{{2- 6f + 1)((2 - 6)^ + 2) < 30. 
Choosing both b and 5 small enough, we obtain 

(3.15) e-^"<e-"*, 6i,„ < ^ and 62,™ < ^ 



for sufficiently large re. Then (3.10) follows from (3.12) and (3.15). D 

Proof of Lemma 3.2. We need to show that 

(3.16) PI max \yiA < ^Jan + rey I ^ e'^"" " , 

Vi<«<j<p / 

where yij = X]fc=i ^ki^kj- Now we apply Lemma A. 4 to prove (3.16). Take / = 

{(*> j); 1 < ^ < i < p}- For a = {i,j) G /, set X^ = \yij\ and Ba = {{k, I) £ /; one of A; and / : 
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i or j, but {k,l) / a}. Choose t = ^Un + ny. We first calculate A = A^i in 
the theorem. Since {yij\ (i, j) G /} are identically distributed, 

l<i<j<p 

(3.17) - - 



P -pp ny^>i^+y 



2 \ -y/n V IT' 

Observe that yi2 is a sum of i.i.d. random variables with mean and variance 
1. Since \/{an/n) + y ^ 2^/]ogn as n — > oo, it follows from Lemma A. 3 that 

(3.18) = pfe > V(a„/n) +y) + ^(t^ ^ V(an/n) + y 

~ — ^^n 



/27r 

provided £'|^|'' < cxd for some q > c^ + 2 = Q. Thus 

e-J//2 
(3.19) A, 



Obviously, Xa is independent of {X^;/5 G I\Ba} for any a = (i,j) G /. To 
complete (3.16), by Lemma A. 4, we have to verify that 6i — > and 62 — > 
as n — > c«. It is easy to check that the size of B^ is less than 2p. Thus 

1/2 „^ /n,_\ n/^lyisl ^ /an , ^_\^ ^/'l 



^I<5(p^-P)^(2p)-P^>jf + . =0 - 



by (3.18). Also, by symmetry, 

(3.20) 62 <p(/ -p)-P(|yi2| > Voin + ny, lyisl > Von + ny). 

Here ^a^ + ny/\/n log n — > 2. By Lemma 2.1, the above probability is 0{n^~^) 
for any 6 > 0, provided £;|^|'' < 00 for some g > (2^ + 1)(22 + 2) = 30. Now 
choose 6 < 1, then 62 -^ 0. By Lemma A. 4, (3.16) is concluded. D 

Proof of Theorem 1.3. Define 

fii = ViI{\Vi\<n^)-E7iJ{\7i^\<7^P) 
for p > 0. It is easy to see that 
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where r„ = (l/i/n) Y^^=i -^^i-^(b?il > "'^) because Erji = 0. Clearly, by Markov's 
inequality, 






^P(|r/i|>n'')<— ^ and |r„| < 



^p^_l — l'"l-„p(/3-l)-(l/2)' 

Thus, to prove the lemma, it suffices to show that 



(3.21) pll^Y^f,^>A <Kne-'Vm-). 

By the Chebyshev inequality and independence, we obtain that 

/ 1 " \ " 

(3.22) P\^ym>tp\<e-^'^\\Ee^^^/'^ 

for any 6 > 0. Since e^ <l + x + {x^/2) + (|xp/6)el^l for any x G R, 

(3.23) Ee'^^/^ < 1 + ^Efjf + Jl^E^^ eM0\m/V^))- 

Obviously, Efjf < Ei]f and |7?i| < 2n''. It follows that 

E{%\'eMd\m\/^)) < 2nV^"'"*'^'^i?|7?,|2. 
Since 1 + x <e^ for any x G M, by (3.23), we have that 

Substituting this back to (3.22), we obtain 

for any 6 > 0. Choosing 9 = tp/M2, it follows that 



since tp <t and t > 0, where -fC„ is as in the statement of Theorem 1.3. Then 
(3.21) follows. The proof is complete. D 
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APPENDIX 

For the proofs of the main theorems we quote some results from hterature 
in this section. 

The fohowing is a corohary of the Marcinkiewicz-Zygmund inequahty, 
see, for example, Corollary 2 on page 368 for p>2 and Theorem 2 on page 
367 for pG [1,2) from [7]. 

Lemma A.l. // {rjn,n > 1} are i.i.d. random variables with Ei]i = 0, 
E\rji\'P < oo , p > 1 , and Sn = J27=i Vi- Then, 



' "' ~\0{n), ^fl<p 



ifl<p<2. 

The following is Lemma 2.2 from [17], which is a useful version of the 
maximal inequality of Hoffmann- J0gensen, see [9] or Proposition 6.7 from 

[16]. 

Lemma A. 2. Let {%,1 < k < n} be independent symmetric random 
variables and Sn = J2k=i %• Then, for each integer j > 1, there exist positive 
numbers Cj and Dj depending only on j such that for all t > 0, 



P{\Sn\> 2jt) < CjPim^xJrijl >t\+ Dj{P{\Sn\ >t)y. 

The following lemma is from [1]. It is a refinement of Theorem 2 from 
[21]. See also page 254 from [20]. 

Lemma A. 3. Let {?7i,r/2, . . .} be a sequence of independent random vari- 
ables with Erji = and E-qf = af > for all i. Define Sn = Yll=i Vi ^'^^ 
Bn = J2'i=i'^i- Suppose 

liminf< — ^ > > and limsup< — y^E'|r7j|'^ > < oo 
n [ n } n [nf^^ J 

for some q>2. Let ^{x) = J^^{2TT)-'^/^e-^^/'^ dt. Then 

P{Sn/VK>x) ^ 
l-^{x) ~^ 

uniformly on [0, Cy/logn] for any c £ (0, ^/q^^) as n —> cxd. 

The following Poisson approximation result is essentially a special case of 
Theorem 1 in [3], which is again a special case of the general Chen-Stein 
Poisson approximation method. One application of the following lemma is 
studying behaviors of maxima of random variables. See, for example, [10] 
and [11]. 



DISTRIBUTIONS OF THE LARGEST ENTRIES 15 

Lemma A. 4. Let I he an index set and {B^^a G /} he a set of subsets of 
I, that is, Ba C / for each a £ I. Let also {r]a,Oi G /} he random variables. 
For a given t G M, set A = J2aei Pi'Ha > t). Then 



P ( max 77a < t I — e 



<(lAA-i)(6i+62 + &3), 



where 



a&ipeBc 

b2 = J2 J2 P{7]o,>t,7]f3>t), 

ael ay^f3eBa 

b3 = J2 E\PiVa > t\a{r]p,Pi B^)) - P{r^a. > t)\, 

and a{r]j3,f3 ^ Ba) is the a-algebra generated by {'r]i3,P ^ -Bq}- In particular, 
if Tja is independent of {rji^, (3 ^ B^} for each a, then 63 = 0. 

The following is Lemma 2 from [4]. 

Lemma A. 5. Let{v,Vij, i,j = l,2,...} be a double array of i.i.d. random 
variables and let a > 1/2, /? > and M > be constants. Then as n^ 00, 



max 

l<j<Mnf 



if and only if E\v\^^~^^''°' < 00. 



n 



a.s. 



Acknowledgments. The author thanks Iain Johnstone very much for sug- 
gesting the study of the largest entry of the covariance matrix in the begin- 
ning of Section 3. The author also thanks an anonymous referee for reading 
the manuscript carefully and pointing out a relevant paper. 

REFERENCES 

[1] Amosova, N. N. (1972). On limit theorems for the probabilities of moderate devia- 
tions. Vestnik Leningrad. Univ. 13 5-14. MR331484 

[2] Anderson, T. W. (1984). An Introduction to Multivariate Statistical Analysis, 2nd 
ed. Wiley, New York. MR771294 

[3] Arratia, R., Goldstein, L. and Gordon, L. (1989). Two moments suffice for Pois- 
son approximation: The Chen-Stein method. Ann. Probab. 17 9-25. MR972770 

[4] Bai, Z. D. (1993). Limit of the smallest eigenvalue of a large dimensional sample 
covariance matrix. Ann. Probab. 21 1275-1294. MR1235416 

[5] Bai, Z. D. (1999). Methodologies in spectral analysis of large dimensional random 
matrices, a review. Statist. Sinica 9 611-677. MR1711663 



16 T. JIANG 

[6] Barbour, A. and Eagleson, G., (1984). Poisson convergence for dissociated statis- 
tics. J. R. Stat. Soc. Ser. B Stat. Methodol. 46 397-402. MR790624 
[7] Chow, Y. S. and Teicher, H. (1988). Probability Theory, Independence, Inter- 
changeability, Martingales, 2nd ed. Springer, New York. MR953964 
[8] Dembo, a. and Zeitouni, O. (1998). Large Deviations Techniques and Applications, 

2nd ed. Springer, New York. MR1619036 
[9] Hoffman- J0RGENSEN, J. (1974). Sums of independent Banach space valued random 

variables. Studia Math. 52 159-186. MR356155 
[10] Jiang, T. (2002). Maxima of partial sums indexed by geometrical structures. Ann. 

Probab. 30 1854-1892. MR1944008 
[11] Jiang, T. (2002). A comparison of scores of two protein structures with foldings. 

Ann. Probab. 30 1893-1912. MR1944009 
[12] Jiang, T. (2002). Maxima of entries of Haar distributed matrices. Preprint. 

MR1944008 
[13] Jiang, T. (2002). The limiting distributions of eigenvalues of sample correlation 

matrices. Preprint. 
[14] Johnstone, I. (2001). On the distribution of the largest eigenvalue in principal 

components analysis. Ann. Statist. 29 295-327. MR1863961 
[15] Ledoux, M. (1992). On moderate deviations of sums of i.i.d. vector random variables. 

Ann. Inst. H. Pomcare Probab. Statist. 28 267-280. MR1162575 
[16] Ledoux, M. and Talagrand, M. (1992). Isoperimetry and Processes in the Theory 

of Probability in a Banach Space. Springer, New York. MR1102015 
[17] Li, D., Rao, M., Jiang, T. and Wang, X. (1995). Complete convergence and almost 

sure convergence of weighted sums of random variables. J. Theoret. Probab. 8 

754-789. MR1308670 
[18] Mheta, M. L. (1991). Random Matrices, 2nd ed. Academic Press, Boston. 

MR1083764 
[19] Nagaev, S. V. (1979). Large deviations of sums of independent random variables. 

Ann. Probab. 7 745-789. MR542129 
[20] Petrov, V. V. (1975). Sums of Independent Random Variables. Springer, New York. 

MR388499 
[21] Rubin, H. and Sethuraman, J. (1965). Probabilities of moderate deviations. 

Sankhyd Ser. A, 325-346. MR203783 

School of Statistics 

University of Minnesota 

313 Ford Hall 

224 Church Street S.E. 

Minneapolis, Minnesota 55455 

USA 

E-MAIL: tjiang@stat.umn.cdu 



